CDS

Accession Number TCMCG075C00798
gbkey CDS
Protein Id XP_017979950.1
Location join(3444681..3444896,3445045..3445282,3445425..3445615,3445801..3446036,3446260..3446503,3446629..3446736,3447024..3447146)
Gene LOC18611267
GeneID 18611267
Organism Theobroma cacao

Protein

Length 451aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA341501
db_source XM_018124461.1
Definition PREDICTED: formamidase [Theobroma cacao]

EGGNOG-MAPPER Annotation

COG_category C
Description formamidase
KEGG_TC -
KEGG_Module -
KEGG_Reaction R00524        [VIEW IN KEGG]
KEGG_rclass RC02432        [VIEW IN KEGG]
RC02810        [VIEW IN KEGG]
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
KEGG_ko ko:K01455        [VIEW IN KEGG]
EC 3.5.1.49        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway ko00460        [VIEW IN KEGG]
ko00630        [VIEW IN KEGG]
ko00910        [VIEW IN KEGG]
ko01200        [VIEW IN KEGG]
map00460        [VIEW IN KEGG]
map00630        [VIEW IN KEGG]
map00910        [VIEW IN KEGG]
map01200        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGGCTCCAACTCCAAGACTGGTGGTGCCAATAGACTTGAAGAAGAAGCCATGGGAACAAACTCTGCCTCTTCACAACCGTTGGCACCCTGAAATACCCTCGGTAGCGGATGTTGAAGTCGGTGAGGTCTTTAGAGTAGAGATGGTAGACTGGACTGGAGGTATCATAAAAGATGATGATTCCGCAACTGATGTAAAATTTATAGATCTCTCCACTGTCCATTATCTCAGTGGGCCAATTAGAGTTGTAGACAAGGATGGCATACCTGCCAAGCCAGGTGATCTTCTTGCAGTTGAAATATGTAACCTGGGTCCTTTGCCGGGGGACGAATGGGGTTATACAGCAACGTTTGACAGAGAAAATGGAGGAGGTTTCTTGACAGACCATTTTCCATGCGCAACTAAAGCTATTTGGTATTTTGAAGGAATTTATGCCTACTCTCCCCATATACCAGGGGTGAGATTTCCGGGTTTGACTCATCCTGGAATAATTGGGACTGCACCATCAATGGAACTTCTGAATATATGGAATGAAAGGGAAAGAGAAGTAGAAGAAAATGGCCATAAGTCCCTAAAACTATGTGAAGTTTTGCATTCAAGACCGTTGGCAAACCTTCCATCAACCAAAGGCTGCCATTTAGGAAAGATAACTAAGGGGACTGCTGAATGGGAAAAGATTGCTAAGGAAGCGGCAAGGACTATTCCTGGAAGAGAAAATGGAGGAAATTGTGACATCAAGAACCTTAGTAGAGGTTCGAAAATATATCTTCCAGTGTTCGTAGAAGGAGCAAATTTCAGCACCGGTGACATGCATTTTTCTCAAGGTGATGGTGAAGTTGCCTTCTGCGGAGCAATTGAGATGAGTGGTTTTCTAGAGCTAAAGTGCGAAATCATAAGAGGTGGAATGAAAGAGTACCTTACTCCAATGGGGCCAACCCCACTTCATGTAAACCCAATCTTCGAGATTGGCCCAGTTGAACCCAGATTCTCAGAATGGCTGGTGTTTGAGGGGATAAGCGTGGATGAGACCGGAAGGCAACATTTCCTTGATGCAAGTGTTGCATATAAACGTGCAGTACTCAATGCTATTGACTACCTCTCTAAGTTTGGGTACTCCAAAGAACAGATATACCTTCTGCTATCCTGCTGCCCATGTGAAGGAAGGATATCTGGAATTGTGGATTCACCAAATGCTCTTGCAACTCTTGCAATTCCAACTGCTATCTTTGACCAGGACATTCGTCCAAAAACTGGAAAGGTACCAGTAGGGCCTCGGCTAGTGAGAAAACCAGATGTCTTAAGATGCACTTACGATGGAAATCTTCCCACAACAAAGAACCCAGCTGCCTTGATGTAA
Protein:  
MAPTPRLVVPIDLKKKPWEQTLPLHNRWHPEIPSVADVEVGEVFRVEMVDWTGGIIKDDDSATDVKFIDLSTVHYLSGPIRVVDKDGIPAKPGDLLAVEICNLGPLPGDEWGYTATFDRENGGGFLTDHFPCATKAIWYFEGIYAYSPHIPGVRFPGLTHPGIIGTAPSMELLNIWNEREREVEENGHKSLKLCEVLHSRPLANLPSTKGCHLGKITKGTAEWEKIAKEAARTIPGRENGGNCDIKNLSRGSKIYLPVFVEGANFSTGDMHFSQGDGEVAFCGAIEMSGFLELKCEIIRGGMKEYLTPMGPTPLHVNPIFEIGPVEPRFSEWLVFEGISVDETGRQHFLDASVAYKRAVLNAIDYLSKFGYSKEQIYLLLSCCPCEGRISGIVDSPNALATLAIPTAIFDQDIRPKTGKVPVGPRLVRKPDVLRCTYDGNLPTTKNPAALM